Supervised Learning of a Probabilistic Lexicon of Verb Semantic Classes
نویسندگان
چکیده
The work presented in this paper explores a supervised method for learning a probabilistic model of a lexicon of VerbNet classes. We intend for the probabilistic model to provide a probability distribution of verb-class associations, over known and unknown verbs, including polysemous words. In our approach, training instances are obtained from an existing lexicon and/or from an annotated corpus, while the features, which represent syntactic frames, semantic similarity, and selectional preferences, are extracted from unannotated corpora. Our model is evaluated in type-level verb classification tasks: we measure the prediction accuracy of VerbNet classes for unknown verbs, and also measure the dissimilarity between the learned and observed probability distributions. We empirically compare several settings for model learning, while we vary the use of features, source corpora for feature extraction, and disambiguated corpora. In the task of verb classification into all VerbNet classes, our best model achieved a 10.69% error reduction in the classification accuracy, over the previously proposed model.
منابع مشابه
Computational Lexicography and Lexicology A Large-Scale Extension of VerbNet with Novel Verb Classes
Lexical classifications have proved useful in supporting various linguistic and natural language processing (NLP) tasks. The largest verb classification in English is Levin's (1993) work. VerbNet (Kipper-Schuler 2006) the largest computational verb lexicon currently available for English provides detailed syntactic-semantic descriptions of Levin classes. While the classes included are extensive...
متن کاملLearning Frames from Text with an Unsupervised Latent Variable Model
We develop a probabilistic latent-variable model to discover semantic frames—types of events or relations and their participants—from corpora. Our key contribution is a model in which (1) frames are latent categories that explain the linking of verb-subject-object triples in a given document context; and (2) cross-cutting semantic word classes are learned, shared across frames. We also introduc...
متن کاملرشد جنبه معنایی فعل در کودک فارسیزبان: مطالعه طولی
Objective Learning “verb” as one of the main components of sentence, has been always a debatable topics in the process of language learning. One of the important issues in “verb” learning is determining its meaning using syntactic clues and learning its semantic aspects. Therefore, the main objective of this study was to examine the development of the semantic aspect of ...
متن کاملA Supervised Algorithm for Verb Disambiguation into VerbNet Classes
VerbNet (VN) is a major large-scale English verb lexicon. Mapping verb instances to their VN classes has been proven useful for several NLP tasks. However, verbs are polysemous with respect to their VN classes. We introduce a novel supervised learning model for mapping verb instances to VN classes, using rich syntactic features and class membership constraints. We evaluate the algorithm in both...
متن کاملClass-Based Construction of a Verb Lexicon
We present an approach to building a verb lexicon compatible with WordNet but with explicitly stated syntactic and semantic information, using Levin verb classes to systematically construct lexical entries. By using verb classes we capture generalizations about verb behavior and reduce the effort needed to construct the lexicon. The syntactic frames for the verb classes are represented by a Lex...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009